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The theory of generalised measurements is used to examine the problem of discriminating un- 
ambiguously between non-orthogonal pure quantum states. Measurements of this type never give 
erroneous results, although, in general, there will be a non-zero probability of a result being incon- 
0^ ' elusive. It is shown that only linearly-independent states can be unambiguously discriminated. In 

addition to examining the general properties of such measurements, we discuss their application to 
entanglement concentration. 
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I. INTRODUCTION 



In classical mechanics, it is possible in principle to determine the state of any physical system. The state of a 
classical system is described by its canonical coordinates and momenta, which can be measured simultaneously to 
arbitrary precision. In quantum mechanics, however, the state of a system is represented as a vector in a Hilbert space, 
and is not itself an observable quantity. Precise determination of a completely unknown state vector is precluded by 
the nature of the quantum measurement process. It is only when the state belongs to a known orthogonal set that it 
can be infallibly determined, by a standard von Neumann measurement. 

When confronted with the problem of trying to discriminate between non-orthogonal states, we must accept that 
no strategy will correctly reveal the state of the system with unit probability. It is then of considerable theoretical 
and practical importance to find the optimum strategy, that is, the one with the highest probability of giving the 
correct result. This may be regarded as the central problem of quantum detection theory, which was established in 
the 1960s and 70s through the seminal work of Helstrom[l] and Holevo[2]. An indispensable tool here is the theory 
of generalised measurements or quantum operations^ , which lays down the necessary and, in principle, sufficient 
conditions for the physical realisability of quantum state transformations. The optimum detector problem consists 
of finding the quantum operation which gives the minimum average error probability for a known ensemble of states 
whose a priori probabilities are also given. 

More recently, it has been realised that another, and in some circumstances, potentially more useful approach to 
discriminating between non-orthogonal states is possible. Ivanovic[4] demonstrated the existence of a detector which 
is able to discriminate unambiguously between a pair of non-orthogonal pure states, that is, with zero error probability. 
This measurement does not contravene the laws of quantum mechanics since, for non-orthogonal states, it has less 
than unit probability of giving an answer at all. There will be a non-zero probability that the measurement returns an 
inconclusive result. The Ivanovic measurement has subsequently been streamlined and investigated in greater depth 
by Dieks[5] and Peres[6]. The Ivanovic-Dieks-Peres (IDP) measurement is a generalised measurement involving a 
unitary interaction with a two-state ancilla particle. A von Neumann measurement is then performed on the latter. 
This has two outcomes. One outcome maps both states of the particle of interest onto orthogonal states, which 
can be unambiguously discriminated. The other outcome causes them to be mapped onto the same state, giving 
an inconclusive result and completely erasing the information we wish to obtain. The IDP measurement gives the 
lowest average probability of obtaining inconclusive results for unambiguous measurements when both states, which 
we denote by \tp± >, have equal a priori probabilities of 1/2. The probability of the IDP measurement resulting in 
an inconclusive result is 

Pidp = | < i>+\i>- > |- (1.1) 

Measurements of this kind have been performed in the laboratory by Huttncr et al[7]. Weak pulses of light 
were prepared in non-orthogonal polarisation states, a fraction of which were converted into orthogonal ones by a 
loss mechanism. Applications of this type of measurement to eavesdropping on quantum cryptosystems have been 
discussed by Ekert et al[8] , and some of the consequences of carrying them out locally on part of an entangled system 
are discussed in [9]. It is, in fact, possible to attain a lower probability of inconclusive results than the Pidp limit 
provided we are prepared to allow for errors[10]. Given a fixed probability of inconclusive results Pi<Pidp, the 
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minimum error probability Pe saturates the inequality 

PePd>\{Pidp - Pif (1.2) 

where Pp = 1 — Pj — Pe is the successful discrimination probability. Setting Pj = gives the well-known Helstrom 
bound[l] Pe>(1 — [1 — | < > | 2 ] 1 / 2 )/2, which corresponds to the highest probability of giving the correct result 

among all possible strategies. 

The IDP analysis has recently been extended by Jaeger and Shimony[ll] to the more general case of two states with 
unequal a priori probabilities. While this analysis exhausts the theoretical problem of optimising unambiguous state- 
discrimination measurements for a pair of pure states, little or no attention has been given to the general problem 
of error-free discrimination between multiple quantum states. This paper is concerned with the general theory of 
unambiguous state-discrimination. In section II, it is shown that the necessary and, by construction, sufficient 
condition for the existence of unambiguous measurement strategies for a given set of non-orthogonal pure states is 
that the states must be linearly-independent, and we explore the general properties of the appropriate measurement 
transformations. These measurements are found not to be of the standard von Neumann type, but are more general 
quantum operations. We then describe the variational problem for optimum measurements. In section III, we show 
that the results of Jaeger and Shimony follow from our general formalism when restricted to two states, and derive the 
minimum probability of inconclusive results for n states among a class of constrained measurements, the constraint 
being that all states have equal conditional probabilities of being identified. The optimum such measurement is 
used in section IV, where we extend our earlier discussion[9] of the relationship between the IDP measurement and 
entanglement concentration[12] to the n-state case. Here, we describe a method based on state-discrimination which 
converts a fraction of an ensemble of n-level imperfectly-entangled systems into maximally-entangled ones. 



II. UNAMBIGUOUS STATE DISCRIMINATION 



Consider a quantum system known to be in one of the non-orthogonal pure states \ipj >, where j — l,..,n. We 
denote by H the Hilbert space spanned by these states. We wish to design a quantum measurement whose result tells 
us which state the system was prepared in, and furthermore, which never gives errors. The non-orthogonality of the 
\ipj > means that no standard von Neumann measurement on H can fulfil these requirements, and we must look to 
generalised measurements for the solution to the problem. Any generalised measurement can be expressed in terms 
of a set of linear transformation operators Ak acting on H[3]. The only constraint on these operators is that they 
must satisfy the resolution of the identity 

E4^ = i. (2.i) 

k 

Each Ak corresponds to a distinct outcome of the operation. Taking the system to be prepared with the initial 
density operator p, the probability Pf, of the fcth outcome is TipAlAk- Correspondingly the density operator is 
transformed according to p—>Ak'pA\/Pk- Here, we are concerned with quantum operations which allow us to unam- 
biguously discriminate between the n states \ipj >, and see that such a measurement must have n outcomes signalling 
the detection of each of the states. We assume, for the sake of simplicity, that only one outcome of the operation 
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corresponds to the detection of each of the states, as this affects none of the conclusions that follow. An additional 
outcome is however required to allow for inconclusive results, the possibility of which is unavoidable if the measurement 
strategy is required to be error-free, as has been found in studies of the problem of discriminating between just two 
states[4-ll]. Thus, the state-discrimination measurement will be characterised by the n operators A k corresponding 
to the states and a further transformation operator Ai generating inconclusive results. These operators must satisfy 

A\A I + Y,AlA k = i. (2.2) 

k 

The fcth outcome should only arise if the initial state is \ip k >, implying the following constraint: 

<^\A\A k \^>=P j 8 jk . (2.3) 

We now proceed to show that this zero-errors condition can only be met if the \tpj > are linearly- independent. To 
accomplish this, let us attempt to express them as superpositions of each other by writing 

r 

and determine the constraints imposed upon the fj r by Eq. (2.3). Inserting this expression for > into Eq. (2.3), 
we find 

E fjr'fjr < A'\AlA k \^ r >= P 5 ]k . (2.5) 

The last factor on the left of Eq. (2.5) is easily simplified with the aid of the Cauchy-Schwarz inequality 

| < Vvl^IifclVv > | 2 < < il>r\A\A k \i) r >< Vv|i[ifc|VV' > (2.6) 
and Eq. (2.3), which together imply that 

< VV'l^jUfclVv >= P r 5 rr ,5 rk . (2.7) 

Substitution of this expression into Eq. (2.5) then gives \fj r \ 2 — djr, which implies that none of the states can be 
a linear superposition of the others. They must be linearly- independent. 

We can easily establish the form of the A k using the constraint Eq. (2.7). One useful consequence of Eq. (2.7) is 
that the states A k \ip k > are orthogonal, so that we may write 

ifcl^fe >=P k 1/2 \^ k > (2.8) 

where the \4> k > form an orthonormal basis for H. A further useful property of A k which follows from Eq. (2.7) is that 
it annihilates the subspacc spanned by all \ipj > for j^k. Let us denote this subspace as H k , whose one-dimensional 
orthogonal complement we take to be spanned by the state >. From these two observations, we can infer that 
A k has the form 

Ak = Tm^^ >< ^ (2 ' 9) 

It is worth remarking that the states \tp k > are also linearly- independent. If there was a superposition J2 k a k\i ! k > 
which was equal to the zero vector, then acting upon this state with the operator J2 k l^fc >< V'fel would give 
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J2k a k < V'felV'fe > \i>k >— 0, which is impossible since the \tpk > are linearly-independent. Note that the < V'fclV'fc > 
factors are all non-zero due to the linear- independence of the \ipk >■ 

We have assumed that the states \ipk > span the Hilbert space upon which the transformation operators act. If, 
however, we examine a situation where the Hilbert space of the system is larger than the space TC spanned by the 
\ipk > ; a more general form of the Ak is permitted. We briefly discuss these more general measurements in the 
Appendix, and show that there always exists a measurement restricted to TL which is optimal, in the sense that it 
maximises the average success probability. 

Let us examine the problem of maximising the average success probability. We denote by r\j the a priori probability 
that the system was prepared in the state >, whereupon we find that the probability of successfully determining 
the state of the system is 

P D {\il>j >, Vj ) = J2vj < 4>M)A 3 \^ 3 >=Y^vjPj- (2-10) 

3 3 

Thus, we interpret Pj as the conditional probability that, given the system was prepared in the state \tpj >, this state 
will be identified. 

This probability is constrained by the positivity of the operator AjAr which, together with the decomposition of 
unity in Eq. (2.2), is equivalent to saying that none of the eigenvalues of the positive operator y^^AjA-j are greater 
than 1. Thus, we can state the variational problem whose solution is the maximum value of Pu{\ipj >,T)j) in the 
following way. Let us define the probability operator 

n({^}) = E i ] i ^ = >< V>,H (2.ii) 

3 3 

where we have let 

q 3 = \<^Hj>\- 2 . (2.12) 

We also denote by \{{Pj}) the maximum eigenvalue of Il({Pj}). Then we wish to vary the Pj so as to obtain the 
maximum value of Pod^j >,Vj) subject to the constraint that \{{Pj})<l- 

We can prove that the average discrimination probability is equal to 1 only when the states \ipj > are orthogonal. 
If Pi){\ipj >,i]j) — 1, then it is clear from Eq. (2.10) that the conditional discrimination probabilities Pj must also 
be 1. Assuming that this is the case, let us consider taking the expectation value of Tl({Pj = 1}) for an arbitrary 
quantum state > and determine the conditions under which its maximum attainable value is 1. Due to the 
linear-independence of the \ipj > and the fact that they span the Hilbert space, we can expand any state as a unique 
superposition of them, 

|V>=^- 1/2 E C ^>> (2-13) 

3 

such that \cj\ 2 — 1 and where the normalisation factor TV is 

N = J2 C *3'^<^3>- (2-14) 

3,3' 

Using the observation that < ipp\ipj > is non-zero only when j = j', we find that 
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< xP\tl({P 3 = >= N- 1 . (2.15) 

Therefore, we seek the conditions under which the normalisation factor N is not less than 1 for any state. This can 
be accomplished by defining the Hermitian operator 

W = J2<iP J ,\iP j >\w j , >< Wj \, (2.16) 

3,3' 

where the \wj > form an orthonormal basis for the Hilbert space. Corresponding to every state \ip > given by Eq. 
(2.13), there is a unique state \ip >— J2j c j\ w j >; an d the expectation value of W for this state is TV. This allows us 
to show that the necessary and sufficient condition for N>1 is that the \ipj > must be orthogonal, by the following 
argument. The trace of W is equal to n, the dimension of the Hilbert space, implying that its average eigenvalue is 
1. Therefore, if the eigenvalues are unequal, some must be less than unity, as must TV for some states \tp >. This can 
be averted only if all of the eigenvalues are equal to 1, implying that W is identity operator, which clearly, from Eq. 
(2.16) would require the \tpj > to be orthonormal. Therefore, unambiguous state-discrimination measurements have 
unit probability of succeeding only when the states are orthogonal. 

The maximum value of Pu(\tpj >,rjj) will always be obtained for Pj such that \({Pj}) = 1. Were this not the 
case, we could multiply all Pj by an equal factor bringing \{{Pj}) up to 1, with a corresponding increase in the value 
of Podtpj >,Vj)- O nc important consequence of this is that, for optimum measurements, when inconclusive results 
are obtained, the set of states \ipj > is mapped onto a linearly-dependent set. The proof of this is simple: given that 
li({Pj}) has an eigenvalue 1, the subspace spanned by the corresponding eigenvectors is, as a consequence of Eq. 
(2.2), the kernel of A\Aj. Consider a state \<j> > lying in this kernel. We can expand this state as \<j) >— J2j a j\tPj >■ 
Normalisation is unimportant here. If we also write Aj\ipj >= \ipj >, then 

I>|$>=0. (2.17) 

3 

Thus, the (unnormalised) states \ip'j > are linearly-dependent. From our proof that only linearly-independent states 
can be unambiguously discriminated, we see that for the maximum value of Podipj >,Vj)> when an inconclusive result 
is obtained, one cannot make a further attempt to determine the state of the system unless one is prepared to use a 
strategy which has a non-zero error probability. In the particular case of two states, they are mapped onto the same 
state, rendering any kind of further attempt impossible. 

The general solution to the variational problem for arbitrary states \ipj > and a priori probabilities rjj is presently 
unknown. It is doubtful that a closed form can be found for the general maximum value of the average discrimination 
probability Pu(\ipj >,rjj). However, certain special cases can be treated analytically. One is the problem for just 
two states, for which the variational problem can be solved exactly. In the following section we show that the results 
of Jaeger and Shimony[ll] can be derived from our general formalism. We also determine the optimum value of 
Pd(\iI>3 >i 7 lj) m the case of n states with the additional constraint that all Pj are equal. 

III. EXAMPLES 

To find the maximum value of Pu{\ij)j >,rjj) for a pair of non-orthogonal pure states, it is convenient to denote 
them by \ip± >, and to exploit the fact that they can always be represented as 
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|V>± >= cos6»|+ > ±sin0|- >, (3.1) 

where the angle 9 lies in the range [0, 7r/4] and the states |± > constitute an orthogonal basis for the space spanned 
by \ip± >• The system may be represented as a spin-1/2 particle, and |± > taken to be the eigenstates of a z with 
eigenvalues ±1. We find that 

|V>± >= sin<9|+ > ±cos0|- >, (3.2) 

where < V'+IV'- >=< i>-\ip+ >— and < tp±\ip± >= sin2#. Given that the states \ip± > have respective a priori 
probabilities r/±, we wish to find the maximum value of the discrimination probability PpQipj >,r)j) given by Eq. 
(2.10), subject to the constraint that the maximum eigenvalue of the operator 

n({^}) = —^-{(P+ + P-W - ^ cos2d ) + ( p + - P-)^sm29} (3.3) 
2sm 29 

is equal to 1. The eigenvalues of this operator are easily determined, and we find that the greater of them is equal to 
1 only when 

(1 - P+)(l - P_) = | <V+IV>- >\\ (3-4) 

where we have used cos20 = | < ip+\ip- > |. Eq. (3.4) is precisely the constraint obtained by Jaeger and Shimony[ll] 
when considering unambiguous state-discrimination using a unitary interaction with a two-state ancilla followed by 
a von Neumann measurement. The remainder of their analysis follows from maximising the average discrimination 
probability with respect to the constraint in Eq. (3.4) for 0<P±<1. Given that it is possible, without loss of generality, 
to take rj + >r?_ , the Jaeger-Shimony minimum probability Pjg of an inconclusive result is 

Pjs = 2(r7+?7_) 1/2 | < V+IV'- > I (3-5) 

when (??-/?7+) 1 / 2 >| < > | and 

Pjs=V+\ <lM^- > \ 2 + V- (3-6) 

whenever (f7-/f?+) 1//2 <| < ip+\ip- > |. In this latter case, the optimum measurement is of the von Neumann type 
where A + = \ip+ >< il>+\,Ai = \tjj- >< ip-\ and A- = 0. We see that the state > is never detected. Thus, for 
sufficiently large values of the overlap | < ip + \ip^ > | or the a priori probablity r/ + of the |i/> + > state, the optimum 
unambiguous measurement will consist of detecting only this state, and this occurs when the previously unknown 
state of the system is projected onto >, the state orthogonal to \ip- >. 

One further special case which is exactly soluble is that of n states subject to the constraint that all Pj are equal 
to some constant P. It follows from Eq. (2.10) that the average discrimination probability Po(|?/>j > 1 i]j) is also equal 
to P, whose value can be deduced from our knowledge that the largest eigenvalue of tl{{Pj = P)} must be 1. This 
simply implies that 

PdW, >,Vj) =P= (max^-l < > I 2 )" 1 , (3.7) 

3 

that is, the optimal discrimination probability is simply the reciprocal of the maximum eigenvalue of the operator 
J2j Ijli'j' >< V'/ 1 ■ We shall make use of this measurement in the next section, where we show how state-discrimination 
measurements, when performed locally on part of a larger, imperfectly entangled system, can leave the system in a 
maximally- entangled state. 
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IV. STATE DISCRIMINATION AND ENTANGLEMENT CONCENTRATION 



In this section, we examine the effect of carrying out state-discrimination measurements on a subsystem which 
is part of a larger system in an imperfectly entangled pure state. We generalise to the case of n-level systems our 
earlier demonstration [9] that the IDP measurement, which discriminates between a pair of non-orthogonal states, can 
serve as a basis for entanglement concentration, that is, can convert a fraction of a number of imperfectly entangled 
two-state systems into maximally-entangled ones such as singlets. In this special case, our method is equivalent to 
the 'Procrustean' technique proposed by Bennett et al[12]. 

Let us begin by considering a pair of n-level quantum systems. Our analysis can easily be generalised to systems 
with Hilbert spaces of different dimension. Giving our subsystems the labels 1 and 2, the state space H of the 
composite is the tensor product Hi®H2 of their individual Hilbert spaces. Any pure state \<f> > in H can be written 
as a Schmidt decomposition[13] 

n-l 

\<t>>=^ c o\ a j >®\^ >' ( 4>1 ) 

1=0 

that is, as a single sum where the \otj > and \(3j > arc orthonormal bases for Hi and H2 respectively. Note that 
in this section, we have adopted the convention of letting the index j run from to n — 1, which turns out to be 
more convenient in what follows. The state is normalised so that Y]j \cj\ 2 — 1. We assume that all of the Cj arc 
non-zero. Maximally-entangled states are those for which all of the \cj\ are equal to 1/n. Let us now imagine that we 
have a large number of composite systems prepared in the state \<j> >, and show how a modified state-discrimination 
measurement of the type described in sections II and III, performed locally on one part of each entangled pair, can 
transform a fraction of these systems into maximally-entangled states. 

To do this, we must express the state \(j> > in a more appropriate form. Corresponding to the basis \ctj > for Hi, 
we can define a conjugate basis 

-2m jk 



n 



(4.2) 



The basis \jk > is canonically conjugate to the \ctj > in the sense discussed by Kraus[14] and Pegg et a/[15]. The 
orthonormality of the \jk > is a consequence of the identity 

-2mj(k- k')~ 



- Vexp 

71 



= Skk> 



(4.3) 



It follows that the state \<j> > can be rewritten as 

where the new states \ipk > for the second system are given by 

\tpk >= £ c J exp 



2irijk 



n 



k > 



\Pi > ■ 



(4.4) 



(4.5) 



These states are normalised, although they are not orthogonal; if they were, \4> > would be maximally-entangled. 
They are, however, linearly- independent. To show this, let us look at the consquences of supposing that 
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^a fe |Vfe>=0 (4.6) 



k 

for some coefficients ak- Substituting into this sum the expansion in Eq. (4.5), we find that the condition 

E2wijk 
a fe exp 
n 

k 



= (4.7) 



must be satisfied for all j. Multiplication of Eq. (4.7) throughout by e summing over j and making further 

use of Eq. (4.3) gives ay = 0. Thus, the only way that Eq. (4.6) can be true is if all coefficients in this sum are zero, 
so the | ■(/>£, > arc linearly-independent. 

In Eq. (4.4), \(f> > has equal coefficients, so it is the non-orthogonality of the |V>fc > that prevents this state from 
being maximally-entangled. However, recall that a measurement which unambiguously discriminates between these 
states does so by mapping them onto orthogonal states. Having established their linear-independence, we see that 
such an operation is possible for \<p >. 

The state-discrimination measurement of section II must, however, be adapted somewhat to achieve our present 
aim; the transformation operators Ak defined in Eq. (2.9) would convert \cf> > into a mixed state. A way to avoid 
this comes from viewing the state-discrimination measurement as a two-stage procedure. First, the states \ipk > arc 
mapped onto the orthogonal states \(pk >. A von Neumann measurement is then carried out which projects onto this 
latter basis. Entanglement concentration requires only the first stage of this operation to be carried out, so let us 
define the orthogonalisation operator 

pl/2 

' 7,, 



This operator performs the required mapping of the \ipk > onto the orthogonal \<pk >. That state-discrimination can 
be regarded as this operation followed by a von-Neumann measurement is easily seen from the fact that Ak — \(f>k >< 
4>k\Ao- Note that we also have 

A t Ao=J2 A l A k = i - A \Ai, (4.9) 
fe 

where Ai is the operator introduced in Eq. (2.2). In state-discrimination attempts, this operator generates inconclusive 
results. In the present context of entanglement concentration, its action results in a failure to produce the desired 
maximally-entangled state. 

The state given by ^4o|0 >> when normalised, is a standard maximally- entangled state when the Pk in Eq. (4.8) 
are set to the same value, P. The probability Pc of successful conversion of |<^> > into a maximally-entangled state is 

P c =< 4>\ti A \<l) >= P, (4.10) 

that is, the probability of transforming \<j> > into a maximally-entangled state using our method is simply the probabil- 
ity of successfully discriminating between the states \ipk > when we require the conditional discrimination probabilitcs 
of these states to be equal. We gave the formal solution, Eq. (3.7), to the problem of finding the maximum value of P, 
and hence Pc in the preceding section. However, it turns out that for \tpk > given by Eq. (4.5), a particularly simple 
expression for P can be obtained. We wish to evaluate the reciprocal of the maximum eigenvalue of the operator 
Sfe9 fc l^fc ><; V'fe I- The states \ipj- > are found to be 
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|^>=7V- 1 /^ c *-i ex p 



3 

-2 



2m jk 



\0j >, (4.11) 



where the normalisation factor N = J2j \ c j\ an d the qj, which we introduced in Eq. (2.13), are 

N 

gj = - 2 . (4.12) 
Collecting these results together, we find, with the aid of Eq. (4.3), that 

5> fc |^ xV'fel = jEl^" J lft ><fti ( 4 - 13 ) 

k j 

Since the |/3j > are orthonormal, the eigenvalues of this operator are simply read off to be l/n\cj\ 2 . Thus, the 
maximum probability of converting \ip > into a maximally-entangled state using our concentration prodedure is 

P c = nxmin(| Cj | 2 ). (4.14) 

It is interesting to see what happens when the entanglement concentration attempt fails. This naturally involves 
examining the operator Aj, which is constrained by Ao and Eq. (4.9) to have the form Aj = U\l — A^Ao] 1 / 2 , 
where U is any unitary operator acting on Ti.2- This operator annihilates all terms \ctj > ®\(3j > in the Schmidt 
decomposition Eq. (4.1) which correspond to the smallest of the \cj\ 2 . Therefore, a failure to produce the desired 
maximally entangled state will remove the possibility of the subsystems 1 and 2 being found in the states \aj > and 
\Pj >, and thus their correlation. In the case of just two states, the initial entangled state becomes a product state. 



V. SUMMARY 



In this paper, we have considered the problem of unambiguous state-discrimination. Our particular aims were to 
determine the general conditions under which this is possible, and the form of the appropriate measurement operators. 
As we have seen, one can discriminate among a set of pure states \ipj > if and only if they are linearly-independent. 
The measurement itself consists of mapping these states onto an orthogonal set \<f)j >, followed by a von Neumann 
measurement in this basis. 

Only orthogonal states can be unambiguously discriminated with unit probability. In general, there will be a non- 
zero probability of obtaining inconclusive results. It is clearly of interest to study the conditions under which the 
average discrimination probability Pjj(\tpj >,T]j) reaches its maximum value for a given set of states \tpj > with a 
priori probabilities rjj. We have formulated the appropriate variational problem, and shown that its solution for just 
two states is the same as that obtained by Jaeger and Shimony[ll] using a specific measurement model. We have 
also found the maximum discrimination probability for n states with the additional constraint that all Pj are equal. 
Clearly, however, the general problem deserves further study. An important goal of future work is the determination of 
the maximum discrimination probability for cases other than those considered here. A still more challenging problem 
is to devise practical techniques for accurately preparing a given quantum system in one of many non-orthogonal pure 
states, and realising the appropriate transformations. 

As an application of the general formalism of sections II and III, we showed in section IV how state-discrimination 
may be related to entanglement concentration. It is well known that any inseperable pure state with two subsystems 
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can be expressed as a biorthogonal expansion known as the Schmidt decomposition, Eq. (4.1). If the state is not 
maximally-entangled, the coefficients are not equal, but can be made so if we choose to express one of the subsystems 
in terms of non-orthogonal, but nevertheless linearly-independent states. The first stage of the state-discrimination 
operation, namely the mapping of these states onto an orthogonal basis, can then be used to produce a maximally- 
entangled state, by acting only on one of the subsystems. However, as with state-discrimination, this process is not 
guaranteed to be successful, and we run the risk of degrading rather than enhancing the correlations when it fails. 

APPENDIX 

Here, we give a brief discussion of more general unambiguous state-discrimination measurements where the Hilbert 
space H spanned by the \ipj > is a proper subspace of the space spanned by the possible states of the system. We 
shall denote this larger space by Hs- This is the direct product Hs — H®H , where H is the orthogonal complement 
of H within Hs- A generalisation of the measurement described in section II is possible under these circumstances 
since there exists more than one state in Hs which has a non-zero overlap with the state \ipj > and is orthogonal 
to all |V>fc > for k^=j. For measurements restricted to H, these states are uniquely, up to a phase, given by \ipj- >. 
However, in the more general case we are considering here, any state of the form 

Wsj >=N\^f > +"j\Xj > (A.l) 

may be used, where the \\j > are arbitrary normalised states in H and |^j| 2 + \vj\ 2 = 1. The subscript S will be 
used in what follows for states and operators in the enlarged space Hs- Clearly, 

<4>k\Xi >=<ti\Xj >=0. (A.2) 

The discussion in section II leading to the construction of the transformation operators Aj, given by Eq. (2.9), can 
easily be repeated in this broader setting. We find that the corresponding generalised transformation operators Asj 
have the form 

The \<psj > can be any n orthonormal states in Hs- As is the case with measurements restricted to H, the sum 
J2j AgjAsj does not equal the identity operator, so we are obliged to introduce an operator Asi generating incon- 
clusive results which satisfies 

4i4w + E4Ai = is, ( A -4) 

3 

where Is is, of course, the identity on Hs- The probabilities Pj are restricted by the constraint that the maximum 
eigenvalue Xs({Pj}) of the probability operator 

n s ({Pj}) = E = E >< ( A - 5 ) 

3 3 

must not exceed 1. For optimum measurements, giving the maximum value of Podipj >, we find that As({P,}) = 
1. This can be established by an argument similar to that given in section II which showed that \({Pj}) = 1 for 
optimum measurements confined to H 
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We shall now prove that, for given states \ipj > and a priori probabilities rjj, the maximum value of the average 
discrimination probability PpQipj >,r]j) can always be obtained by considering measurements which are restricted to 
the space H. We begin with the observation that, for any values of the probabilities Pj and any state \ip > in H, we 
have 

< m{{p 3 })\i> >=< ms({Pj}M > • (a.6) 

The support of n({Pj}) is H, so we are entitled to apply Eq. (A.6) when \ip > is an eigenstate of Tl({Pj}) 
corresponding to its maximum eigenvalue \({Pj}). If \ip > is also an eigenstate of fls({Pj}) with corresponding 
maximum eigenvalue Xs({Pj}), then both of these extremal eigenvalues are seen from Eq. (A.6) to be equal. If not, 
however, then (A.6) implies that As({Pj}) must exceed A({P,}), giving the general inequality 

As({P J -})>A({P,-}) (A.7) 

We can use this result to show how the assumption that the maximum value of PpQipj >,T]j) cannot be attained 
for measurements restricted to TL leads to a contradiction. If the Pj correspond to the optimum measurement, 
giving the maximum value of PuQipj >, rjj) in Eq. (2.10), and that this measurement cannot be restricted to H., then 
Xs({Pj}) = 1. This implies that a restricted measurement with the same probabilities Pj would give A({P,})<1. Thus, 
we can consider another restricted measurement with probabilities P- = P,/A({P,}) such that Pj>Pj, giving at least 
as high a value for the average discrimination probability. The maximum eigenvalue A({P-}) °f tnc corresponding 
probability operator II({P.}) is 1, so this measurement is allowed. Therefore, for any measurement on the larger 
Hilbert space Hs, we can construct a measurement restricted to H which is at least as good, contradicting the 
premise. 
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